WHAT IS CLAIMED IS: 



\ 1. In a data processing system, an apparatus 

comprising: 

\ identity means for determining, for any of a 
pluraAty of data items in the system, a substantially 
unique Identifier, said identifier depending on all of 
the datAin the data item and only on the data in the 

data item and 

Existence means for determining whether a 
particularViata item is present in the system, by 
examining the identifiers of the plurality of data items. 

2. \ An apparatus as in claim 1, further 

comprising: \ 

locaA existence means for determining whether 
an instance of a particular data item is present at a 
particular location in the system, based on the 
identifier of the\data item, 

3. An/abparatus as in claim 2, wherein each 
location contains \^_Wstinct plurality of data items, and 
wherein said local existence means determines whether a 
particular data item Ifc present at a particular location 
in the system by examining the identifiers of the 
plurality of data items ^t said particular location in 
the system. \ 

4. An apparatus as in claim 2, further 
comprising: \ 

data associating means for making and 
maintaining, for a £ata item Vn the system, an 
association between the data rtem and the identifier of 
the data item; and \ 

access means for accessing a particular data 
item using the identifier of the >data item. 
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\ 5. An apparatus as in claim 2, further 

comprising: 

\ duplication means for copying a data item from 
a source to a destination in the data processing system, 
by providing said destination with the data item only if 
it is determined using the data identifier that the data 
item is nofc present at the destination, 

6A An apparatus as in claim 4, further 

comprising: \ 

assimilation means for assimilating a new data 
item into the system, said assimilation means invoking 
said identity means to determine the identifier of the 
new data item an\i invoking said data associating means to 
associate the ne\A data item with its identifier. 

7. An apparatus as in claim 4, further 
comprising: /CN 

dup licati<pn\ means for duplicating a data item 
from a source locati>M^r€o a destination location in the 
data processing system A based on the identifier of the 
data item, said duplication means invoking said local 
existence means to deterrAne whether an instance of the 
data item is present at th€* destination location, and 
invoking said access means Vo provide said destination 
with the data item only if said local existence means 
determines that no instance oi£ the data item is present 
at the destination. \ 

8. An apparatus as in claim 7, further 
comprising: . _ \ 

backup means for making >copies of data items in 
the system, said backup means maintaining a backup record 
of identifiers of data items backed \up, and invoking 
duplication means to copy only thoseVdata items whose 
data identifiers are not recorded in \he backup record. 



78 



\ 9. An apparatus as in claim 8, further 

comprising: 

\ recovery means for retrieving a data item 
previously backed up by said backup means, based on the 
identiMer of the data item, said recovery means using 
the backup record to identify the data item, and invoking 
access means to retrieve the data item. 

Vo. An apparatus as in claim 2, wherein a 
location iA a computer among a network of computers, the 
apparatus further comprising: 

remote existence means for determining whether 
a data item is present at a remote location in the system 
from a current\ location in the system, based on the 
identifier of Ohe data item, said remote location using 
local existencevmeans at the remote location to determine 
whether the data\u:em is present at the remote location, 
and providing tnmcurrent location with an indication of 
the presence of traedata item at the remote location. 

11. An apparatus as in claim 4, wherein a 
location is a computer among a network of computers, the 
apparatus further coribrising: 

requesting means for requesting a data item at 
a current location in the system from a remote location 
in the system, based om the identifier of the data item, 
said remote location usxng access means at the remote 
location to obtain the dAta item and to send it to the 
current location if it is present. 

12. An apparatus\as in claim 1, further 
comprising: ^ \ 

context means for making and maintaining a 
context association between aA least one contextual name 
of a data item in the system and the identifier of the 
data item; and \ 



\ referencing means for obtaining the identifier 

of A data item in the system given a contextual name for 
the oata item, using said context association. 

\ 13. An apparatus as in claim 12, further 
comprising: 

Assignment means for assigning a data item to a 
contextual Viame, invoking said identity means to 
determine the identifier of the data item, and invoking 
said context Veans to make or modify the context 
association between the contextual name of the data item 
and the identifier of the data item. 

14. Ar\ apparatus as in claim 12, further 
comprising: \ 

data associating means for making and 
maintaining, for a dat^ item in the system, an 
association between /the data item and the identifier of 
the data item; \\y 

access means tor accessing a particular data 
item using the identified of the particular data item; 
and \ 

contextual name Access means for accessing a 
data item in the system for\a given context name of the 
data item, determining the da^a identifier associated 
with the given context name, *nd invoking said access 
means to access the data item using the data identifier. 

15. An apparatus as iX claim 11, further 
comprising: \ 

transparent access means \f or accessing a data 
item from one of several locations, Vising the identifier 
of the data item, said transparent aqce'ss means invoking 
said local existence means to determine if the particular 
data item is present at the current location, and, in the 
case when the particular data item is ndt present at the 
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current location, invoking said requesting means to 
obtain the data item from a remote location. 

\ 16. An apparatus as in claim 15, further 
comprising: 

\ identifier copy means for copying an identifier 
of a data item from a source location to a destination 
locationA 

£7. An apparatus as in claim 15, further 

comprising: \ 

context means for making and maintaining a 
context association between a contextual name of a data 
item in the system and the identifier of the data item; 

context copy means for copying a data item from 
a source location to a destination location, given the 
contextual name W the data item, by copying only the 
context association between the contextual identifier and 
the data identif ief^irom the source location to the 
destination location; and 

transparent/ referencing means for obtaining a 
data item from one or several locations the system given 
a contextual name foA the data item, said transparent 
referencing means invoking said context association to 
determine the data identifier of a data item given a 
contextual name, and invoking said transparent access 
means to access the data item from one of several 
locations given the identifier of the data item. 

18. An apparatus as in claim 1, wherein at 
least some of said data i perns are compound data items, 
each compound data j,tem inaluding at least some component 
data items in a fixed sequence, and wherein the identity 
means determines the identifier of a compound data item 
based on each component data Utem of the compound data 
item. \ 



81 



\ 19. An apparatus as in claim 18, wherein said 

compound data items are files and said component data 
items\are segments, and wherein the identity means 
detemnes the identifier of a file based on the 
identifier of each data segment of the file. 

\ 20. An apparatus as in claim 18, wherein said 
compound data items are directories and said component 
data items Nare files or subordinate directories, and 
wherein theVdentity means determines the identifier of a 
given directory based on each file and subordinate 
directory within the given directory. 

21. Vn apparatus as in claim 11, further 
comprising: \ 

means rtor advertising a data item from a 
location in the system to at least one other location in 
the system, said mdans for advertising providing each of 
said at least one oWfei location with the data identifier 
of the data item, ana providing the data item to only 
those locations of sa^Lck^other locations that request said 
data item in response to said providing. 

22. An apparatus as in claim 18, further 
comprising: \ 

local existence iieans for determining whether a 
particular data item is present at a particular location 
in the system, based on the identifier of the data item; 

and \ 

compound copy means Mr copying a. data item 
from a source to a destination rn the data processing 
system, said compound copy means Ninvoking said local 
existence means to determine whetWer the data item is 
present at the destination, and to Wermine, when the 
data item is a compound data item, wkether the component 
data items of the compound data item are present at the 
destination, and providing said destination with the data 
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itek only if said local existence means determines that 
the tiata item is not present at the destination, and 
providing said destination with each component data item 
only £f said local existence means determines that the 
component data item is not present at the destination. 

\23. An apparatus as in claim 11 , further 
comprising\ 

m\ans for verifying the integrity a data item 
obtained froW said requesting means in response to 
providing sa:m requesting with a particular data 
identifier, toy confirm that the data item obtained from 
the requesting Veans is the same data item as the data 
item requested, \said verifying means invoking said 
identity means to determine the data identifier of the 
obtained data iteapaand comparing said determined data 
identifier with slid particular data identifier to verify 
said obtained datJ^JLJ^m. 

24. An apparatus as in claim 2, wherein a 
location is at least one of a storage location and a 
processing location, and wherein a storage location is at 
least one of a data storage device and a data storage 
volume, and wherein a processing location is at least one 
of a data processor and a\ computer. 

25. An apparatuAas in claim 3, wherein at 
least some of said data itemis are compound data items, 
each compound data item including at least some component 
data items in a fixed sequence^ and wherein the identity 
means determines the^identif ieA of a compound data item 
based on the identifier of each V:omponent data item of 
the compound data item. \ 

26. An apparatus as in oclaim 3, further 
compr is ing : \ 
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\ context associating means for making and 

maintaining a context association, for any data item in 
th syktem, between the identifier of the data item and 
at leas\ one contextual name of the data item at a 
particular location in the system; 

Wans for obtaining the identifier of a data 
item in th^ system given a contextual name for the data 
item at a particular location in the system; and 

logical copy means for associating the data 
identifier corresponding to a contextual name at a source 
location with ^contextual name at a destination location 
in the data processing system. 

27. An Apparatus as in claim 25, wherein said 
compound data itemA ar^i files and said component data 
items are segments, /^nd wherein the identity means 
determines the idenOfi^ of a file based on the 
identifier of each darta segment of the file. 

28. An apparatus as in claim 25, further 
comprising: \ 

compound copy means for copying a data item 
from a source location to >a destination location in the 
data processing system, sard compound copy means invoking 
said local existence means determine whether the data 
item is present at the destination, and to determine, 
when the date item is a compound data item, whether the 
component data items of the compound data item are 
present at the destination, and ^providing said 
destination with the data item ofily if said local 
existence means determines that the data item is not 
present at the destination, and providing said 
destination with each component data item only if said 
local existence means determines that, the component data 
item is not present at the destination. 
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\ 29* An apparatus as in any of claims 1-28, 

wherein a data item is at least one of a file, a database 
records a message, a data segment, a data block, a 
directory, and an instance an object class. 

\30. A method of identifying a data item in a 
data processing system for subsequent access to the data 
item, the method comprising the steps of: 

determining a substantially unique identifier 
for the data Item, said identifier depending on all of 
the data in thk data item and on the data in the data 

item; and \ 

accessing a data item in the system using the 
identifier of theNdata item. 

31. A method as in claim 30, further 
comprising the step QtfpMj 

making and/ ftaiptaining, for a plurality of data 
items in the system, ah^ association between each of the 
data items and the identifier of each of the data items, 
wherein said accessing step accesses a data item via the 
association. \ 

32. A method as\ in claim 31, further 
comprising the step of: \ 

assimilating a new data item into the system, 
by determining the identifier of the new data item and 
associating the new data item with its identifier. 

33. A method for duplicating a given data item 
from a source location to a destination location in a 
data processing system, the method comprising the steps 
of: \ 

determining a substantially unique identifier 
for the given data item, said identifier depending on all 
of the data in the data item and only on the data in the 
data item; \ 
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\ determining, using said data identifier, 

whether said data item is present at said destination 
location ; and 

\ based on said determining, providing said 
destination location with said data item only if said 
data item y s n °t present at said destination. 

3^. A method as in claim 33, wherein said 
given data ittem is a compound data item having a 
plurality of component data items, the method further 
comprising the^teps of:/ 

for each data item of said component data 
items , \ 

\ obtaining the component data 
identifier of the data item by determining a 
substantially unique identifier for the data 
item, saicAidentif ier depending on all of the 
data in ttle^^ta item and only on the data in 
the data item; 

determining, using said obtained 
component data\ identifier , whether said data 
item is presentXat said destination; and 

based pn said determining, providing 
said destination with said data item only if 
said data item isVot present at said 
destination. \ 

35, A method for determining whether a 
particular data item is present un a data processing 
system, the method comprising thA steps of: 

(A) for e§ch data item\pf a plurality of data 
items in the system^ \ 

(i) determining a substantially unique 
identifier for the data itkm, said identifier 
depending on all of the data in the data item 
and^only on the data in the data item; and 
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\ (ii) making and maintaining a set of 

\ identifiers of said plurality of data items; 
\ and 

\ (B) for the particular data item, 
\ (i) determining a particular 

\ substantially unique identifier for the data 
Vtem, said identifier depending on all of the 
dWta in the data item and only on the data in 
tire data item; and 

\ (ii) determining whether said particular 
identifier is in said set of data items. 

36. AWethod of backing up, of a plurality of 
data items, data Vtems modified since a previous backup 
time in a data processing system, the method comprising 
the steps of: Y\ 

(A) maintaining a backup record of identifiers 
of data itetos backed up at the previous backup 
time; and \ 

(B) for eacli of said plurality of. data items, 

(i) determining a substantially unique 
identifies for the data item, said 
identif ier\depending on all of the data in 
the data item and only on the data in the 
data item; \ 

(ii) determining those data items of the 
plurality of data items whose identifiers 
are not in the\backup record; and 

(iii) based on said determining, copying 
only those data Vtems whose data 
identities are noV recorded in the backup 
record. \ 

37. A method as in claim\36, further 
comprising the step of: \ 
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recording in the backup record the identifiers 
of thoVe data items copied in said step of copying. 
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\ 39, The method of claim 38, further comprising 

th s\eps of : 

\ (a) for each data item of a plurality of data 
\ items at said provider locations, 
\ making and maintaining an association 

\ between the data item and the identifier 
\ of the data item, 

(b\ in response to said notifying, said client 
location copying said data item from one of 
saidAresponding remote locations, using said 
association to access the data item given the 
data ioentifier. 

40. A method of locating a particular data 
item among a plurality of locations, each of said 
locations having a plurality of data items, the method 
comprising the steps of^ 

determining/ \r or the particular data item and 
for each data item ofv^tk^/plurality of data items, a 
substantially unique id^atif ier for the data item, said 
identifier depending on aul of the data in the data item 
and only on the data in tnte data item; and 

determining the presence of the particular data 
item in each of said plurality of locations by 
determining whether the identifier of the particular data 
item is present at each of said locations. 

41. The method of claim 30, wherein said step 
of accessing further comprises tne steps of, for a given 
data identifier and for a given d^rrent location and a 
remote location in *he system: \ 

determining whether the <*ata item corresponding 
to the given data identifier is present at the current 
location, and \ 

based on said determining, \t said data item is 
not present at the current location, fetching the data 
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itemXfrom a remote location in the system to the current 
location. 

\ 42. The method of claim 41, further comprising 

the stefts of : 

\ for each contextual name at a location, 
\ making and maintaining a context 
association between the context name of a data item and 
the identifier of said data item, and when some context 
association drtianges at said current location, and 

\ notifying -said remote location of a 
modification tb the context association. 

43. TSfoe method of claim 42, further comprising 
the step of: \ 

at said\ remote location, updating the 
association betwe^fi>khe contextual identifier of the data 
item and the iddhtMier of the data item. 

44. ThcTTufthod of claim 43, further comprising 
the step of: \ 

from said remote location, notifying all other 
locations that said datta item has been modified, by 
providing the contextual identifier and data identifier 
of said data item to saidk other locations. 

45. The method af claim 44, further comprising 
the step of, at each location notified that the data item 
has been modified: \ 

modifying an association between the contextual 
identifier of the da£a item arid the data identifier of 
the data item, to record that the data item has been 
modified. \ 

46. A method of eliminating a data item at a 
given location in a data processing system when said data 



90 



item can be obtained from another location in the system, 
the method comprising the steps of: 

determining a substantially unique identifier 
for the data, said identifier depending on all of the 
data in the data item and only on the data in the data 
item; 

making and maintaining a source association 
between the data identifier and at least one location at 
which said data item is known to be present; and 

based on said source association, if said data 
item is present at said other location, removing the data 
item from the given location. 

47. A method of deleting a data item from a 
location in a data processing system, the method 
comprising the steps of: 

for each of a plurality of data items in the 

system: 

determining a substantially unique identifier 
for the data, said identifier depending on all of the 
data in the data item and only on the data in the data 
item; and 

making and maintaining, an association between 
each of the data items and the unique identifier of the 
data items; and 

for a given data item: 

determining a substantially unique identifier 
for the data, said identifier depending on all of the 
data in the data item and only on the data in the data 
item; and 

determining whether a contextual identifier or 
a compound data ite» or a remote processor in the system 
refers to the unique identifier of the data item, and 
based on said determining,. dele^ng--3aid data item and 
its association if no other contextual identifier or 
compound data item or remote processor refers to said 
data item. 



48. The method of claim 47, wherein said 
determining is based on a use count for the data item, 
and wherein said data item is deleted only if said use 
count indicates that no other contextual identifier or 
compound data item or remote processor in the system 
refers to the data item. 

49. A method of substantially synchronizing 
data items at a client location in a data processing, 
system after a period of independent changes on the 
client and another location in the system, given a 
context, the method comprising the steps of: 

making and maintaining a list of changes to the 
context association between each context name of a data 
item and the identifier of said data item, in the given 
context and during the period of independent change; 

obtaining the list of changes from the other 
location for the given context; and, 

for each context name in the list of changes 
updating the context identifier 
associations at the client whenever it is determined that 
the context association of the given context name changed 
either only at the client or only at the other location 
during the period if independent changes; and 

performing a conflict-resolution task such 
as notifying an operator of the client location, whenever 
it is determined that the context association changed at 
both the client and the other location. 

50^ A method as in claim 49, wherein said 
lists are maintained*, as queues based on a temporal order, 
and wherein, at, said client location, said replacing is 
based on said tefi^oral order. 

51. A method of maintaining at least a 
predetermined number\pf copies of a given data item in a 
data processing systesL at different locations in the 
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dlta processing system, said data processing system being 
onto wherein data is identified by a substantially unique 
identifier, said identifier depending on all of the data 
in \he data item and only on the data in the data item, 
and Vherein any data item in the system may be accessed 
usinAonly the identifier of the data item, the method 
comprising the steps of: 

(i) sending, from a first location in the 
system, the data identifier of the given data 
item to other locations in the system; and 

(ii) in response to said sending, at each of 
said other locations, 

i) determining whether the data item 
corresponding to the data identifier is present 
at \the other location, and based on said 
determining, and 

(B) Xinforming said first location whether said 
data Mearxis present at the other location; and 

(iii) $n response to said informing from said 
other /Locations, at said first location, 

(A) dafcgpdining whether said data item is 
present in at least the predetermined number of 
other locations, and based on said determining, 

(B) when less than the predetermined number of 
other locations have a copy of the data item, 
requesting some locations that do not have a 
copy of the data item make a copy of the data 
iten. 



52. A method as \in claim 51, wherein said step 
(iii) further comprises the^tep of: 

(C) when^more thaA the predetermined number of 
other locations have a copy on the data item, requesting 
some locations that do have a ctopy of the data item 
delete the copy of the data iter 
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A method as in any of claims 30-52, 
wherein said\da^ta items are at least one of a file, a 
database recAAd, a message, a data segment, a data block, 
a directory, ai 



an instance of an object class. 
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